Stochastic models for horizontal gene transfer: taking a random walk through tree space.

نویسنده

  • Marc A Suchard
چکیده

Horizontal gene transfer (HGT) plays a critical role in evolution across all domains of life with important biological and medical implications. I propose a simple class of stochastic models to examine HGT using multiple orthologous gene alignments. The models function in a hierarchical phylogenetic framework. The top level of the hierarchy is based on a random walk process in "tree space" that allows for the development of a joint probabilistic distribution over multiple gene trees and an unknown, but estimable species tree. I consider two general forms of random walks. The first form is derived from the subtree prune and regraft (SPR) operator that mirrors the observed effects that HGT has on inferred trees. The second form is based on walks over complete graphs and offers numerically tractable solutions for an increasing number of taxa. The bottom level of the hierarchy utilizes standard phylogenetic models to reconstruct gene trees given multiple gene alignments conditional on the random walk process. I develop a well-mixing Markov chain Monte Carlo algorithm to fit the models in a Bayesian framework. I demonstrate the flexibility of these stochastic models to test competing ideas about HGT by examining the complexity hypothesis. Using 144 orthologous gene alignments from six prokaryotes previously collected and analyzed, Bayesian model selection finds support for (1) the SPR model over the alternative form, (2) the 16S rRNA reconstruction as the most likely species tree, and (3) increased HGT of operational genes compared to informational genes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stochastic Analysis of Seepage through Natural Alluvial Deposits Considering Mechanical Anisotropy

The soil is a heterogeneous and anisotropic medium. Hydraulic conductivity, an intrinsic property of natural alluvial deposits varies both deterministically and randomly in space and has different values in various directions. In the present study, the permeability of natural deposits and its influence on the seepage flow through a natural alluvial deposit is studied. The 2D Finite Difference c...

متن کامل

Discrete random walk models for symmetric Lévy - Feller diffusion processes

We propose a variety of models of random walk, discrete in space and time, suitable for simulating stable random variables of arbitrary index α (0 < α ≤ 2), in the symmetric case. We show that by properly scaled transition to vanishing space and time steps our random walk models converge to the corresponding continuous Markovian stochastic processes, that we refer to as Lévy-Feller diffusion pr...

متن کامل

A PRELUDE TO THE THEORY OF RANDOM WALKS IN RANDOM ENVIRONMENTS

A random walk on a lattice is one of the most fundamental models in probability theory. When the random walk is inhomogenous and its inhomogeniety comes from an ergodic stationary process, the walk is called a random walk in a random environment (RWRE). The basic questions such as the law of large numbers (LLN), the central limit theorem (CLT), and the large deviation principle (LDP) are ...

متن کامل

A survey on random walk-based stochastic modeling in eukaryotic cell migration with emphasis on its application in cancer

Impairments in cell migration processes may cause various diseases, among which cancer cell metastasis, tumor angiogenesis, and the disability of immune cells to infiltrate into tumors are prominent ones. Mathematical modeling has been widely used to analyze the cell migration process. Cell migration is a complicated process and requires statistical methods such as random walk for proper analys...

متن کامل

A survey on random walk-based stochastic modeling in eukaryotic cell migration with emphasis on its application in cancer

Impairments in cell migration processes may cause various diseases, among which cancer cell metastasis, tumor angiogenesis, and the disability of immune cells to infiltrate into tumors are prominent ones. Mathematical modeling has been widely used to analyze the cell migration process. Cell migration is a complicated process and requires statistical methods such as random walk for proper analys...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genetics

دوره 170 1  شماره 

صفحات  -

تاریخ انتشار 2005